feat: add Qwen3-Omni Thinker GSPO support by qinganrice · Pull Request #6238 · verl-project/verl

qinganrice · 2026-05-04T01:36:58Z

Summary

Register Qwen3-Omni model in AutoModelForCausalLM with forward redirect to Thinker, fix tie_word_embeddings and _no_split_modules for FSDP compatibility
Fix FSDP LoRA deadlock: skip lambda wrap policy when min_num_params > 0 to avoid nested FSDP allgather divergence
Cast LoRA params to base model dtype after get_peft_model so FSDP can flatten mixed-dtype units
Strip unused sub-modules (Talker/Code2Wav) after from_pretrained via _verl_strip_modules
Add Thinker layer prefixes to layered_summon with fallback to full summon when layered returns empty
Fix text_config fallback in monkey_patch for models without top-level num_attention_heads
Duck-typing fix for vLLM LoRA request to support vllm-omni's LoRARequest
Add gsm8k_thinker reward with </think> extraction and \boxed{} support
Register vllm_omni / vllm_omni_ar in rollout and replica registries for verl-omni integration

Test plan

End-to-end GSPO LoRA training with Qwen3-Omni thinker model

CLAassistant · 2026-05-04T01:37:05Z

All committers have signed the CLA.

gemini-code-assist

Code Review

This pull request introduces support for the Qwen3-Omni model architecture and enhances FSDP and LoRA handling. Key changes include registering the Qwen3-Omni Thinker as a causal language model with custom forward and embedding logic, implementing a module stripping mechanism to reduce memory usage during FSDP initialization, and adding a new reward scoring utility (gsm8k_thinker) designed for models that output reasoning steps. Additionally, the PR updates LoRA parameter collection to support diffusers and adds a fallback mechanism for parameter summoning. Review feedback highlights the need to narrow broad architecture mappings to prevent conflicts with encoder-decoder models, improve exception handling during model registration, refine regex patterns in the reward scorer to handle currency symbols, and remove debug print statements from production code.

qinganrice added 3 commits May 3, 2026 13:00

feat: Qwen3-Omni Thinker RL support

e10eea6

feat: register vllm_omni and vllm_omni_ar in _ROLLOUT_REGISTRY

e8564d4

Register vllm_omni and vllm_omni_ar in RolloutReplicaRegistry

23790f3

gemini-code-assist Bot reviewed May 4, 2026

View reviewed changes

Comment thread verl/utils/model.py Outdated

Comment thread verl/utils/model.py Outdated

Comment thread verl/utils/reward_score/gsm8k_thinker.py Outdated

Comment thread verl/workers/engine/fsdp/transformer_impl.py Outdated

Fix comments and clean codes

d6e0d82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Qwen3-Omni Thinker GSPO support#6238

feat: add Qwen3-Omni Thinker GSPO support#6238
qinganrice wants to merge 4 commits intoverl-project:mainfrom
qinganrice:qwen3-omni-thinker-v2

qinganrice commented May 4, 2026

Uh oh!

CLAassistant commented May 4, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

qinganrice commented May 4, 2026

Summary

Test plan

Uh oh!

CLAassistant commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented May 4, 2026 •

edited

Loading